In defense of statistical methods for detecting positive selection.

نویسندگان

  • Ziheng Yang
  • Rasmus Nielsen
  • Nick Goldman
چکیده

In a highly publicized article, Nozawa et al. (1) claimed that the branch-site model (BSM) (2, 3) was unreliable because it produced excessive false positives in their simulation experiment. BSM uses a likelihood ratio test to detect positive selection that affects particular branches and codons in proteincoding genes, indicated by accelerated nonsynonymous substitution rates. The authors’ conclusion, if true, would be important. But it is contradicted by their simulation results. The study generated 14,000 datasets under a null model that postulated no positive selection and found that BSM falsely detected positive selection in 32 cases. Nozawa et al. (1) claimed that those false positives were ‘‘not supposed to be obtained theoretically’’ and indicated ‘‘abnormal behaviors’’ of the likelihood ratio test. Those claims are false: the false-positive rate is only 0.23% (32 of 14,000), much lower than the nominal significance level (5%). Contrary to Nozawa et al.’s claims, the test is thus conservative. Nozawa et al. preferred a parsimony-based approach, which averages rates over the whole protein and achieved 0% false-positive rate in their simulation. The authors did not examine the power of the tests. In previous simulations (4), such parsimony-based methods were found to have little power, even when the likelihood ratio tests detected positive selection with 100% power. We suggest that sensible use of statistical methods for detecting positive selection such as BSM (5) is valuable in comparative analysis of genomic data. They can generate biological hypotheses for experimental verification, narrowing down possibilities for test in the laboratory. Nozawa et al.’s results, interpreted correctly, support this view, as do many studies in which the statistical predictions were validated in the laboratory.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Statistical methods for detecting molecular adaptation.

The past few years have seen the development of powerful statistical methods for detecting adaptive molecular evolution. These methods compare synonymous and nonsynonymous substitution rates in protein-coding genes, and regard a nonsynonymous rate elevated above the synonymous rate as evidence for darwinian selection. Numerous cases of molecular adaptation are being identified in various system...

متن کامل

A Novel Architecture for Detecting Phishing Webpages using Cost-based Feature Selection

Phishing is one of the luring techniques used to exploit personal information. A phishing webpage detection system (PWDS) extracts features to determine whether it is a phishing webpage or not. Selecting appropriate features improves the performance of PWDS. Performance criteria are detection accuracy and system response time. The major time consumed by PWDS arises from feature extraction that ...

متن کامل

The mediator role of alexithymia between ego defense styles and the intensity of pain in patients with chronic pain

 Aims and background: Chronic pain, is one of the psychosomatic disease that a lot of people meet throughout their life. The aim of this study was to determine the mediation role of alexithymia between ego defense styles and the intensity of pain that percipience by the patients with chronic pain. Materials and methods: The method of the study was the correlation, and the statistical universe w...

متن کامل

A Multiple Objective Nonlinear Programming Model for Site Selection of the Facilities Based on the Passive Defense Principles

One of the main principles of the passive defense is the principle of site selection. In this paper, we propose a multiple objective nonlinear programming model that considers the principle of the site selection in terms of two qualitative and quantitative aspects. The purpose of the proposed model is selection of the place of facilities of a system in which not only it observes the dispersion ...

متن کامل

Accuracy and power of statistical methods for detecting adaptive evolution in protein coding sequences and for identifying positively selected sites.

The parsimony method of Suzuki and Gojobori (1999) and the maximum likelihood method developed from the work of Nielsen and Yang (1998) are two widely used methods for detecting positive selection in homologous protein coding sequences. Both methods consider an excess of nonsynonymous (replacement) substitutions as evidence for positive selection. Previously published simulation studies compari...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Proceedings of the National Academy of Sciences of the United States of America

دوره 106 36  شماره 

صفحات  -

تاریخ انتشار 2009